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Abstract — MDS array codes are widely used in storage systems 
to protect data against erasures. We address the rebuilding ratio 
problem, namely, in the case of erasures, what is the the fraction 
of the remaining information that needs to be accessed in order 
to rebuild exactly the lost information? It is clear that when the 
number of erasures equals the maximum number of erasures 
that an MDS code can correct then the rebuilding ratio is 1 
(access all the remaining information). However, the interesting 
(and more practical) case is when the number of erasures is 
smaller than the erasure correcting capability of the code. For 
example, consider an MDS code that can correct two erasures: 
What is the smallest amount of information that one needs to 
access in order to correct a single erasure? Previous work showed 
that the rebuilding ratio is bounded between j and |, however, 
the exact value was left as an open problem. In this paper, we 
solve this open problem and prove that for the case of a single 
erasure with a 2-erasure correcting code, the rebuilding ratio is 
|. In general, we construct a new family of r-erasure correcting 
MDS array codes that has optimal rebuilding ratio of j in the 
case of a single erasure. Our array codes have efficient encoding 
and decoding algorithms (for the case r = 2 they use a finite field 
of size 3) and an optimal update property. 

I. Introduction 

Erasure-correcting codes are the basis of the ubiquitous 
RAID schemes for storage systems, where disks correspond 
to symbols in the code. Specifically, RAID schemes are 
based on MDS (maximum distance separable) array codes that 
enable optimal storage and efficient encoding and decoding 
algorithms. With r redundancy symbols, an MDS code is 
able to reconstruct the original information if no more than r 
symbols are erased. An array code is a two dimensional array, 
where each column corresponds to a symbol in the code and 
is stored in a disk in the RAID scheme. We are going to refer 
to a disk/symbol as a node or a column interchangeably, and 
an entry in the array as an element. Examples of MDS array 
codes are EVENODD IT], JU, B-code 13], X-code g), RDP 
ID, and STAR-code |]6|. 

Suppose that some nodes are erased in an MDS array code, 
we will rebuild them by accessing (reading) some information 
in the surviving nodes. The fraction of the accessed informa- 
tion in the surviving nodes is called the rebuilding ratio. If r 
nodes are erased, then the rebuilding ratio is 1 since we need to 
read all the remaining information. However, is it possible to 
lower this ratio for less than r erasures? For example. Figure 
[T] shows the rebuilding of the first systematic (information) 
node for an MDS code with 4 information elements and 
2 redundancy nodes, which requires the transmission of 3 
elements. Thus the rebuilding ratio is 1/2. 

In Q, El, a related problem is discussed: the nodes are 
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Figure 1. Rebuilding of a (4,2) MDS array code over F3. Assume the first 
node (column) is erased. 

assumed to be distributed and fully connected in a network, 
and the repair bandwidth is defined as the minimum amount 
of data needed to transmit in the network in order to retain the 
MDS property. Note that one block of data transmitted can be 
a function of several blocks of data. In addition, retaining MDS 
property does not imply rebuilding the original erased node, 
whereas we restrict our problem to exact rebuilding. Therefore, 
the repair bandwidth is a lower bound of the rebuilding ratio. 

An (n, fc) MDS code has n nodes in each codeword and 
contains k nodes of information and r = n — k nodes of 
redundancy. A lower bound for the repair bandwidth was 
shown as I7j 

k n — k 

where is the total amount of information. It can be verified 
that Figure [T] matches this lower bound. A number of works 
addressed the repair bandwidth problem llTl- llT6l . and it was 
shown by interference alignment in ifTZl . lfT3l that this bound 
is asymptotically achievable for exact repair. Instead of trying 
to construct MDS codes that can be easily rebuilt, a different 
approach ifTTl . ifTsl was used by trying to find ways to rebuild 
existing famiUes of MDS array codes. The ratio of rebuilding 
a single systematic node was shown to be | + o(l) for 
EVENODD or RDP codes. However, from the lower bound 
of O the ratio is 1/2. 

The main contribution of this paper is the first explicit 
construction of systematic {n,k) MDS array codes for any 
constant r = n — k, which achieves optimal rebuilding ratio 
of J. We call them intersecting zigzag sets codes (IZS codes). 
The parity symbols are constructed by linear combinations 
of a set of information symbols, such that each information 
symbol is contained exactly once in each parity node. These 
codes have a variety of advantages: 1) they are systematic 
codes, and it is easy to retrieve information; 2) they are have 
high code rate k/n, which is commonly required in storage 
systems; 3) the encoding and decoding of the codes can be 



easily implemented (for r = 2, the code uses finite field of 
size 3); 4) they match the lower bound ([T]i when rebuilding 
a systematic node; 5) the rebuilding of a failed node requires 
simple computation and access to only 1/ {n — k) of the data 
in each node (no linear combination of data); and 6) they 
have optimal update, namely, when an information element is 
updated, only n — k + 1 elements in the array need update. 

The remainder of the paper is organized as follows. Section 
In] provides definitions and background on MDS array codes. 
Section Hill presents the new constructions of {k + 2,k) MDS 
array codes with an optimal rebuilding ratio. Section |IV] 
introduces the concept of code duplication that enables the 
constructions of {k + 2, k) MDS array codes for an arbitrary 
number of columns. We discuss the size of the finite field 
needed for these constructions in Section |V] Decoding algo- 
rithms for erasures and errors are discussed in Section ED 
Section IVIII provides generalizations of our MDS code con- 
structions to an arbitrary number of parity columns. Finally, 
we provide concluding remarks in Section IVIIII 

II. Definitions and Problem Settings 

In the rest of the paper, we are going to use [i,j] to denote 
{i, i + 1, . . . , j} for integers / < j. And denote the complement 
of a subset X C M as X = M\X. For a matrix A, denotes 
the transpose of A. 

Let A = {cii^j) be an array of size p xk over a finite field F, 
each entry of which is an information element. We add to the 
array two parity columns and obtain an {n = k + 2,k) MDS 
code of array size p x n. Each element in these parity columns 
is a linear combination of elements from A. More specifically. 



let the two parity columns be Q 



[0,p-l], r, 



and 



Q+l = (zo/Zi---/Zp_i)^. Then for / e 
EagR, aflfl and z; = EaeZ, f^a^, for some subsets R/, Z/ of 
elements in A, and some coefficients {an}, {jSa} C F. We 
will call R = {Ro,Ri,...,Rp_i} and Z = {Zo,Zi,...,Zp_i} 
the sets that generate the parity columns. 

Since the code is a (A: + 2, k) MDS code, each information 
element should appear at least once in each parity column 
Q, Q+l- We will assume that each information element in 
A appears exactly once in each parity column, which implies 
optimal update for the code. As a result, we have the following 
theorem. 

Theorem 1 The sets R (or Z ) are partitions of A into p equally 
sized sets of size k. 

Proof: Each set X G R does not contain two entries of A 
from the same column. W.l.o.g. assume X contains two entries 
of A from the first column, then we can not rebuild these 
two elements when the first column and the parity column 
Q+i are erased. Thus X contains at most one entry from 
each column and then |X| < k. However each element of 
A appears exactly once in each parity column, thus if there 
is |X| < fc, X e R, there is Y e R, with \Y\ > k, which 
leads to a contradiction. Therefore, \X\ = k for all X G R. 
As each information element appear exactly once in the first 
parity column, R = {Rq, ■ ■ .,Rp_i} are partitions of A into 








1 


2 


R 


Z 





* 










1 












2 






<> 






3 





9 










Figure 2. Pemiutations for zigzag sets in a (5,3) code witti 4 rows. The 
permutations for rows are the identity permutations. The shaded elements are 
accessed to rebuild column I. 



p equally sized sets of size k. Similar proof holds for the sets 
2 = {2^0/- ■ -/Zp^i}. ■ 

By the above theorem, for the j-th systematic column 
[uq, . . . fUp^l)''^, its p elements are contained in p distinct 
sets R;, / G [0, p — 1]. In other words, the membership of the 
j-th column's elements in the sets {R/} defines a permutation 
gj : [0,p -1] [0,p - 1], such that gj{i) = 1 iff e R/. 
Similarly, we can define a permutation fj corresponding to 
the second parity column, where fj{i) = I iff G Z;. For 
example. Figure |2] shows a (5,3) code. Each element in the 
parity column Z is a linear combination of elements with the 
same symbol. And each systematic column corresponds to a 
permutation of the four symbols. 

Observing that there is no importance of the elements' 
ordering in each column, w.l.o.g. we can assume that the first 
parity column contains the sum of each row of A and gj's 
correspond to identity permutations, i.e. r, = J^^Zq ocj^jUi^j. 
We refer to the first and second parity columns as the row 
column and the zigzag column respectively, likewise R/ and 
Z;, / G [0, p — 1], are referred to as row sets and zigzag sets 
respectively. Call fj, j G [0, fc — 1], zigzag permutations. By 
assuming that the first parity column contains the row sums, 
the code is uniquely defined by (i) the zigzag permutations, 
and (ii) the coefficients in the linear combinations. 

Our approach consists of two steps; first we choose the 
appropriate zigzag sets Zq, ...Zp_i in order to minimize the 
rebuilding ratio, and then we choose the coefficients in the 
linear combinations in order to make sure that the constructed 
code is indeed a (A: + 2, k) MDS code. But first we show that 



for any set of zigzag sets Z = {Zg, Zp_i} there exists 
(fc + 2, k) MDS array code over a field F large enough. For that 
proof we use the well known Combinatorial Nullstellensatz by 
Alon mi: 



Theorem 2 (Combinatorial Nullstellensatz) HI Th 1.2] Let F 
be an arbitrary field, and let f = f{xi, Xq) be a polynomial 
in F[xi, ...,Xq]. Suppose the degree off is deg(/) = EJ^i 
where each tj is a nonnegative integer, and suppose the coeffi- 
cient of HfLi / nonzero. Then, if Si, Sn are subsets 
ofF with \Si\ > tj, there are Si G Si,S2 G S2, G Sq so 
that 

f{si,...,Sq) ^ 0. 

Theorems Let A = {ui^j) be an array of size p xk and the 
zigzag sets be Z = {Zq, Zp_i}, the/i there exists a (fc + 2, fc) 
MDS array code for A with Z as its zigzag sets over the field F 
of size greater than p(A: — 1) + 1. 



Proof: Assume the information of A is given in a 
column vector W of length pk, where column i of A is 
in the row set [(f — l)p + 1, fp] of W. Each systematic 
node f, i G [0,k— 1], can be represented as Q/W where 

Qi = [OpxpiJpxp>Opxp{k-i^l)]- Moreover define = 

[lpxp,Ipxp,--,Ipxp],Qk+l = [xoPo,XlPl, ■.■,X]^^iPj^_i] 

where the P,'s are permutation matrices of size p x p, and 
the x,'s are indeterminates, such that Q = Qj^W, Q+i = 
Qs:+lW. The permutation matrix P; = (pj'j,) is defined 

as pI'^j = 1 if and only if fl„, , G Z; and the P,'s are 
not necessarily distinct. If there exists such MDS code it is 
equivalent to the existence of a set of values for {x;} such that 
for any set of integers {si,S2, Sjt} C [0, fc+ 1] the matrix 
Q = [QJj, QJj, Q^] is of full rank. It is easy to see that if 
the parity column Q+i is erased i.e., k + 1 i {si,S2, Sj.} 
then Q is of full rank. If fc ^ {si,S2, ...,sj;} and A: + 1 G 
{si,S2, ■■.,Sq} then Q is of full rank if none of the x,'s equals 
to zero. The last case is when both k,k + l G {si,S2, ...,sj;}, 
i.e., there are < z < / < A: — 1 such that i {si,S2, ...,sj;}. 
It is easy to see that in that case Q is of full rank if and only 
if the submatrix 



x,P, 



'-pxp 



XjPj 
^pxp 



is of full rank. This is equivalent to det(B;y) ^ 0. Note that 
deg(det(By)) = p and the coefficient of x^ is det(P,) G 
{1,-1}. Define the polynomial 



T = T{xo,xi,...,x,,^i) 



n det(By), 

0</<;<J:-l 



then the result follows if and only if there is an assignment 
flO,fli,--, fl)c-l G F such that r(flO/fli/ ■•■/flfc-i) 7^ 0. T 
is of degree p(2) and the coefficient of YI^Iq x^^'^ 



IS 

n'^rg det(P,)''~' ^ 0. Set for any i,Si = F\0 in Theorem 
|2] then the result follows. ■ 
The above theorem states that there exist coefficients such 
that the code is MDS, and thus we will focus on finding 
proper permutations {/y} first. The idea behind choosing 
the zigzag sets is as follows: assume a systematic column 
{aQ,ai, ...,ap_lY is erased. Each element fl, is contained in 
exactly one row set and one zigzag set. For rebuilding of 
element access the parity of its row set or zigzag set. 
Moreover access the values of the remaining elements in that 
set, except a,. We say that an element fl, is rebuilt by a row 
(zigzag) if the parity of its row set (zigzag set) is accessed. For 
example, in Figure |2] supposing column 1 is erased, one can 
access the shaded elements and rebuild its first two elements 
by rows, and the rest by zigzags. 

In order to minimize the number of accesses to rebuild the 
node, we need to minimize the size of 



U*^^^ S-l 



(2) 



where each S/ G R U Z is either a row set or a zigzag set 
containing a,-. Each |S, | = k, therefore in order to minimize 
the size of the union we will try to maximize the number 



of intersections between the sets {Si}\z}. We say that S = 
(So, Si, Si;_i) rebuilds (fli,fl2, f'p) ■ For the rebuilding 
of node / by S = (Sq, Si, Sj;_i), define the number of 
intersections by l{i\S) = pk — \ U^^Zq Sj\. Moreover define 
the number of total intersections in an MDS array code with 
zigzag sets Z as 



(=0 



S rebuilds i 



Now define h{k) to be the maximum possible intersections 
over all (fc + 2, fc) MDS array codes, i.e.. 



max liZ) 

z 



In Figure 12] the rebuilding set is S = {Rq,Ri,Zq,Zi}, the 
size of (|2|i is 8 and /(1|S) =4. Note that each surviving node 
accesses exactly ^ of its information without performing any 
calculation within it. The following lemma gives a recursive 
bound for the number of intersections. 

Theorem 4 Letq <k<p thenh{k) < ''^^^^^j^j'^^ 

Proof: Let A be an information array of size p x k. 
Construct an MDS array code C such that the first parity is 
the row sums, and the second parity is defined by the zigzag 
sets Z. Suppose the zigzag sets Z are defined such that I{Z) 
is maximized. 

Let B = {bi^j) be a matrix of size (^) x k which is defined 
as follows: each row ; in the matrix corresponds to a subset of 
columns of size q of A, denoted by /. Denote by the subarray 
of A with columns in / as Aj. Define zigzag sets Zj as subsets 
of Z with only elements in columns /. For each Aj, construct 
an MDS array code Cj using Aj as information, row sum as 
the first parity, and Zj as the second parity. 

Each column in B corresponds to column / in A. Let bjj be 
the number of maximal intersections when rebuilding column 
i in the code Cj. U i i J then fey = 0. It is easy to see that 
the sum of each row is no more than h{q). 

On the other hand, the sum of the columns can be computed 
as follows, for the code C, denote by /(/, f|S) the number of 
intersections in column I for rebuilding column i using the 
rebuilding set S. Suppose S* rebuilds column / and maximizes 
/(z|S), then 

J2Lnii\St)=h{k). 

i l^i 

Similarly, for the code Cj, / 7^ z G /, denote by Jj(/, z|S) the 
number of intersections in column I for rebuilding column i 
using a rebuilding set S. Assume S*j rebuilds column i in Cj 
and maximizes E/^/gj z|S), then by definition 



E hili\Sl,) 



Note that S* also rebuilds column i in Cj, but may not be the 
maximum-achieving set: 

E hiii\si,) > E nii\s*). 



Combining the above equations, we get 

LLh' = EE E 

i j i } 

> EE E nu\s*) 
= EE E nu\s*) 

Thus the summation of the entire matrix B leads us to 



(12) 



q{q-l) 



and completes the proof. ■ 
For a fixed number of rows p define the rebuilding ratio 
for a {k + 2,k) MDS code C as 



R{C) 



Y^i^Q accesses to rebuild node i 



pk{k + l) 

which denotes the average fraction of accesses in the surviving 
array for rebuilding one systematic node. Define the ratio 
function for all {k + 2,k) MDS codes with p rows as 

which is the optimal average portion of the array needed to be 
accessed in order to rebuild one lost column. In this expression 
h{q) and pq in the numerator correspond to the number of 
elements we do not access in the systematic nodes and parity 
nodes, respectively. 

Lemma 5 R{k) is no less than j and is a monotone nonde- 
creasing function. 

Proof: From Theorem |4] for k = q + 1 we get h{q + 1) < 



R{q + l) = 1 
> 1 

We want to show 



Then we get 

h{q + l) 



p{q + l){q + 2) q + 2 

m \_ 

p{q-l){q + 2) q + 2 



or 



R{q + l)>R[q) 



h{q) + p{q-l) ^ h{q) + pq 



l){q + 2) - pq{q + l) 



which is equivalent to 



Kq)< 



pq{q-l) 



In the rebuilding of a node (fli, ••■/ ^p)^, for any element 
Uj, one element from the parity nodes is accessed. In total 
we access in the two parity nodes p elements out of 2p 
elements, i.e., exactly j of the information. Let x and p — x 
be the number of elements that are accessed from the first and 
second parity respectively. W.l.o.g we can assume that x > 
otherwise p — x would satisfy it. Each element of these x sets 
is a sum of a set of size q. Thus in order to rebuild the node. 



we need to access at least x{q — 1) > 



elements in 



(3) 



the q — 1 surviving systematic nodes, which is at least half of 
the size of the surviving systematic nodes. So the number of 
intersections is no more than £2ii— LI. Thus Q holds and the 
ratio function R{k) > j. ■ 

The lower bound of 1 /2 in the above theorem can be also 
derived from ([Til. We will see from Lemma [Tol that R{k) is 
almost 1 /2 for all k and p = 2\ where / is large enough. 

By Lemma |5] and ([T]i for any p and k, R{k) > For 
example, it can be verified that for the code in Figure |2] all the 
three systematic columns can be rebuilt by accessing half the 
remaining elements. Thus the code achieves the lower bound 
1/2, and therefore R(3) = 1/2. 

III. {k + 2, k) MDS ARRAY CODE CONSTRUCTIONS 

The previous section gave us a lower bound for the ratio 
function. The question is can we achieve it? If so, how? We 
know that each {k + 2, k) MDS array code with row and zigzag 
columns is defined by a set of permutations fo,---,fk-i and 
their subsets X;'s. The following construction constructs a 
family of such MDS array codes. From any set T C F™, 
|r| = k, we construct a (k + 2,k) MDS array code of 
size 2'" X (k + 2). The ratio of the constructed code will be 
proportional to the size of the union of the rebuilding set. 
Thus we will try to construct such permutations and subsets 
that minimize the union. We will show that some of these 
codes have the optimal ratio of j. 

In this section all the calculations are done over F2. By 
abuse of notation we use x G [0,2™ — 1] both to represent the 
integer and its binary representation. It will be clear from the 
context which meaning is in use. 

Construction 1 Let A = (ai^j) be an array of size 2'" x k for 
some integers k,m and k < 2'". Let T C Fj" be a subset 
of vectors of size k. For v ^ T we define the permutation 
fv : [0,2"' - 1] [0,2"^ - 1] byfv{x) = x + v, where x is 
represented in its binary representation. One can check that this 
is actually a permutation. For example when m = 2,v = (1, 0) 

/(i,o)(3) = (i,i) + (i,o) = (ai) = i, 

and the corresponding permutation of v is [2,3,0,1]. In addi- 
tion, we define X-o as the set of integers x in [0,2™ — 1] such 
that the inner product between their binary representation and 
V satisfies x ■ v = 0, e.g., Xf^-^ gj = {0, 1}. The construction of 
the two parity columns is as follows: The first parity column is 
simply the row sums. The zigzag sets Zq, Z2m_xare defined 
by the permutations {fy. : Vj E T} as fl/y G Z; if fy.(i) = I. 
We will denote the permutation fy. as fj. Assume column j 



R 



(4) 



needs to be rebuilt, and denote Sr = {dj^j : i G Xj} and 
Sz = {o-i j : i i Xy}. Rebuild the elements in Sr by rows 
and the elements in Sz by zigzags. 

Recall that by Theorem |3] this code can be an MDS code 
over a field large enough. The following theorem gives the 
ratio for Construction [T] 

Theorem 6 The code described in Construction [1] and gener- 
ated by the vectors Vq,Vi, ...,V]^_i is a (fc + l,k) MDS array 
code with ratio 

1 Lt^Ei^r\MXi)nfj{x,)\ 

2 2'«fc(fc + l) 

Proof: In rebuilding of node / we rebuild the elements 
in rows X, by rows, thus the first parity column Q accesses 
the values of the sum of rows X,. Moreover, each surviving 
systematic node accesses its elements in rows X,-. Hence, 
by now |X/|fc = 2"'^^k elements are accessed, and we 
manage to rebuild the elements of node i in rows X,. The 
elements of node i in rows X, are rebuilt by zigzag, thus 
the second parity column Q+i accesses the values of zigzags 
{zfj{l) : ' G X,} and each surviving systematic node accesses 
the elements of these zigzags from its column, unless these 
elements are already included in the rebuilding by rows. The 
zigzag elements in : I G X;} of node / are in rows 

fj'^{fi{Xj)), thus the extra elements node need to access 
are in rows f^^{fi(Xi))\Xi. But, 



|/;ri(/,(x:))\x,| = |/;ri(/,(XO)nX^| 

= \fi\m))r^n 

= l(/KX0)n/;(x,-)|, 

where we used the fact that //,/; are bijections, and |X, | = 
|X, | = 2™~^. Hence in rebuilding of node i the number of 
elements to be accessed is 2"'"l(fc + 1) + E^-^/ 1 (//(X,)) n 
fj{Xi)\. The result follows by dividing by the size of the 
remaining array 2'"(fc+ 1) and averaging over all systematic 
nodes. ■ 
The following lemma will help us to calculate the sum in 
but first we associate to any vector v = {v\, Vm) G F™ 
the subset of integers Bp C [m] where i G if Vi — l. 

Lemma? foranyv,u G T 



|/.(x„)n/„(x-,)| 



\X„\, \B^\Bu\ mod 2 = 
0, \Bjj\Bu\ nrod2 = l. 



(5) 



Proof: Consider the group (F^', +). Recall that /^(X) = 
X + v={x + v: xe X}. The sets fv{Xy) = Xy + v and 
fu{Xz,) = Xy + u are cosets of the subgroup Xy = {iv G 
F™ : w ■ V = 0}, and they are either identical or disjoint. 
Moreover, they are identical iff v — u G Xy, namely [v — 
u) ■ V = E,:i,;=i,i,;=o 1 = 0. However, \By\Bu\ mod 2 = 
Eru,=l,M,=0 1' '^he result follows. ■ 
This construction enables us to construct an MDS array code 
from any subset of vectors in F™. However, it is not clear 



which subset of vectors should be chosen. The following is 
an example of a code construction for a specific set of vectors. 

Example 1 Let T = {v e F^" : ||z;||i = 3} be the set of 
vectors with weight 3 and length m. Notice that \T\ = ('3). 
Construct the code C by T according to Construction [TJ Given 



e T, \{u e T : \By\Bu\ 



3}| 



(^g^), which 



is the number of vectors with 1 's in different positions as 



V. Similarly, \{u G T : |B,,\B„| 
\{u G T : |B,\B„| = 1}| = 3(m 
Lemma^ for large m the ratio is 

n 2'"-l(™)3('"2^) 



and 



= 2}| = 3r2 

3). By Theorem^ and 



2 ^ 2'" ("')(( 



_9_ 
2m' 



Note that this code reaches the lower bound of the ratio as 
m tends to infinity. In the following we will construct codes 
that reach the lower bound exactly. 

Let {/o/ •••//fc-i} be a set of permutations over the set 
[0,2™ - 1] with associated subsets Xq, X,,_i C [0,2'" - 1], 
where each |X,| = 2™"^. We say that this set is a set of 
orthogonal permutations if for any i,] G [0, fc — 1] 



|/,(x,)n//x,)| 

2"!— 1 



hi' 



where ^, ^ is the Kronecker delta. Let {e/jj^j be the standard 
vector basis of F^ and set eg to be the zero vector. The 
following theorem constructs a set of orthogonal permutations 
of size m + 1. 

Theorems The permutations /o, ...,/m ^nd sets Xg, Xm 
constructed by the vectors {e/ jj'ig and Construction\l\ where 
Xo is modified to be Xq = {x G : x ■ (1, 1, 1) = 0} is 
a set of orthogonal permutations. Moreover the (m + 3, m + 1) 
MDS array code of array size 2™ x (m + 3) defined by these 
permutations has optimal ratio of j. Hence, R{m + 1) = j. 

Proof: Since |Be;\Be, | = 1 for any / 7^ 7^ we get by 
lemma [7] 

/,-(x,)n/^-(x,-) = 0. 

Note that 

fi{{xi,X2,...Xnt)) = ( 



and 



thus 



X/ = {(xi,X2,...,x,„) gF^" :x, = 0} 



/,(X,) n/o(X,) = {x G F^' : Xi = 1} n {x G F^" : x,- = 0} = 0. 
Moreover, 

/o(Xo)n/KXo) 

m m 

= {x G : ^x, = 0} n {x G F^ : ^x, = 1} 

;=1 i=l 



Hence the permutations /q, . . . , /,« are orthogonal permuta- 
tions and the ratio is 1 /2 by Theorem |6] ■ 
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Figures. (a) The set of orthogonal permutations as in Theorem |8] with 
sets Xo = {0,3},Xi = {0,1},X2 = {0,2}. (f;) A (5,3) MDS an-ay code 
generated by the orthogonal permutations. The first parity column C3 is the 
row sum and the second parity column C4 is generated by the zigzags. For 
example, zigzag Zq contains the elements that satisfy fj{i) = 0. 

Actually this set of orthogonal pemutations is optimal in 
size, as the following theorem suggests. 

Theorem 9 Let F be an orthogonal set of permutations over the 
integers [0, 2'" — 1] , then the size of F is at most m + 1. 

Proof: We will prove it by induction on m. For m = 
there is nothing to prove. Let F = {fo, fi/ fk-i} be 
a set of orthogonal permutations over the set [0,2"' — 1]. 
We only need to show that |F| = k < m + 1. It is 
trivial to see that for any g,h E S2"' the set hFg = 
{hfog,hfig, ...,hfi^_lg} is also a set of orthogonal per- 
mutations with sets g~^{XQ),g^^{Xi),...,g^^{X^_i). Thus 
w.l.o.g. we can assume that /q is the identity permutation and 
Xq = [0,2™"^ — 1]. From the orthogonality we get that 



ullMXo) = Xo = [2'-\2 



Assume the 



Note that for any z 7^ 0, | X; n Xq | = = 2 
contrary, thus w.l.o.g we can assume that |X;nXo| > 2"'^^ 
otherwise |X; n Xo| > 2™"-^. For any 7^ / 7^ we get that 



/;(x,-nXo),/,-(x,nXo) cxo, 
|/y(x,-nXo)| = |/,(x,-nXo|)>2-2 = 



iXn 



(6) 



(7) 



From equations (|6|l and O we conclude that fj{Xj n Xg) fl 
//(X, n Xq) 7^ 0, which contradicts the orthogonality prop- 
erty. Define the set of permutations F* = {/flfZj over the 
set of integers [0,2™"!-!] by/*(x) = fi{x) - 2'"-^, which 
is a set of orthogonal permutations with sets {X; D Xg}*^^^. 
By induction k — 1 < m and the result follows. ■ 
The above theorem implies that any {k + 2, k) systematic 
MDS array code of size 2"' x (fc + 2) with optimal update and 
ratio J, satisfies k < m + 1. Notice that the code in Theorem 
|8] achieves the upper bound, i.e. k = m + 1. 

Example 2 Let A be an array of size 4x3. We construct a 
(5,3) MDS array code for A as in Theorem^ that accesses 
2 of the remaining information in the array to rebuild any 
systematic node (see Figure^. For example, Xi = {0, 1}, and 
for rebuilding of node 1 (column Ci) we access the elements 



ro = «o,o + flo,i + «o,2 

n = «i,o + + «i,2 

2^/1(2) = Zq = flo,0 + 2fl2,l 
^/i(3) = 21 = ai,o + 2fl3,i 



■ 2fll,2 

■ «0,2- 



It is trivial to rebuild node 1 from the accessed information. 
Note that each of the surviving node accesses exactly j of 
its elements. It can be easily verified that the other systematic 
nodes can be rebuilt the same way. Rebuilding a parity node is 
easily done by accessing all the information elements. 

IV. Code Duplication 

In this section, we are going to increase the number of 
columns in the constructed {k + 2, k) MDS codes with array 
size 2'" X (/c + 2), such that k > m + 1 and ratio is approxi- 
mately J. 

Let C be a {k + 2,k) array code where the zigzag sets 
{Z;}|'^J are defined by the set of permutations {//}*^q Q Sp 
and p is the number of rows in the array. For an integer s, an 
s-duplication code C' is an {sk + 2, sk) MDS code with zigzag 
permutations defined by duplicating the k permutations s times 
each. Moreover, the first parity column is the row sums. The 
coefficients in the parities may be different from the code C. 
For an s-duplication code, denote the column corresponding to 
the t-th fj as column < t < s — 1. Call the columns 
/ G [0, A: — 1], the t-th copy of the original code. An example 
of a 2-duplication code of the code in Figure |3] is illustrated 
in Figure |4] 

Theorem 10 If a {k + 2,k) code C has ratio R{C), then its s- 
duplication code C' has ratio atmostR{C){l + ^^)- 

Proof: Suppose in the rebuilding algorithm of C, for 
column i, elements of rows / = {71,72/ ■•■ //«} are rebuilt 
by zigzags, and the rest by rows. In C', all the s columns 
corresponding to /; are rebuilt in the same way: the elements 
in rows / are rebuilt by zigzags. The rebuilding ratio of this 
algorithm is obviously an upper bound of R{C'). W.l.o.g. 
assume column i^^^ is erased, then the average (over all 
i G [0, A: — 1]) of the number of elements needed to access 
in columns for all I ^ i,l E [0,k — 1] is 

R{C)v{k + l)-p, 

for any f G [0, s — 1]. Here the term — p corresponds to the 
access of the parity nodes in C. Moreover, we need to access 
all the elements in columns z'('',0 < f < s — 1, and access p 
elements in the two parity columns. Therefore, the rebuilding 
ratio is 



R{C') < 



s{R{C)p{k +l)-p) + {s-l)p + p 



p{sk + l) 



s{k + l) 
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Figure 4. A 2-duplication of the code in Figure [5] The code has 6 information nodes and 2 parity nodes. The ratio is 4/7. 



Corollary 11 The s-duplication of the code in Theorem^has 



ratio 2(1 



s(m+l)+l 



) , which is J 



2(m+i; 



for large s. 



When s = 1, the code C in Theorem |8] matches the lower 
bound of 1/2, but when s > 2, the ratio is greater than the 
lower bound. 

By Theorem[TT]we know that dupUcation of an optimal code 
has rebuilding ratio a little more than 1/2. In fact, removing 
the first information column (corresponding to the identity /g), 
the code in Theorem [8] has m information columns, 2™ rows, 
and the ratio is 1/2 + 1/ (2m) for large s. In general, given 
2™ rows, we can construct a {k + 2, k) code by k vectors 
{vq, - ■ ■, c/c-i } as in Construction [T] with some ratio r. And its 
s-duplication has ratio r' = (1 + j)r for large s. Is it possible 
that k is much larger than m and r is very close to 1/2, such 
that r' < 1/2 + 1/ (2m)? For example, if we can find a code 
with k = information columns, and r = 1/2 + 1/ [m^-^), 
then its s-dupUcation has ratio r' = (1 + l/m^)r w 1/2 + 
0{l/m^-^) for large s. And this code would have better ratio 
than Corollary [TT] In the following, we are going to show that 
such codes do not exist. Actually the construction made by 
the standard vector basis is optimal. 

Let D = D{y,E) be a directed graph with set of vertices 
V and directed edges £. Let S and T be two disjoint subsets 
of vertices, we define the density of the set S to be 

£ 

and the density between S and T to be dg j 



Es 
W 

where 



2 siir 



Eg is the number of edges with both of its endpoints in S, and 
Eg 7 is the number of edges incident with a vertex in S and 
a vertex in T. The following technical lemma will help us to 
prove the optimality of our constmction. 



Lemma 12 Let D = D{V,E) be a directed graph and S, T be 
subsets ofV, such that S HT = (Z),S L)T = V. We have: 

(i) Ifds T < T^SLX {ds,dj}, then dy < max{dg,dj}. 

(ii) If do, J > max {dg, dj], then dy < d^j. 

Proof: Note that dy = — — ' ' ^y^^ ' " ' ' . W.l.o.g 



assume that rfg > dj therefore 

\S\^ds + \T\^dT + 2\S\\T\dsT 
"^^^ \V[^ 

_ iSpdg + \T\^ds - \T\^ds + |T|2dj. + 2|S||r|ds 

_ ds{\S\ + \T\)^-\T\^{ds-dT) 

<ds. 

If > max{ds,dj} then 

\S\^ds + \T\^dT + 2\S\\T\ds,T 



di 



< 



|y|2 

\S\'^ds,T + \T\^ds,T + 2|S| |T|dg,r 



Thus the result follows. ■ 
Define the directed graph D,„ = Dm{V,E) as V = {W : 
W C [m]}. We also view the vertices as binary vectors of 
length n where each subset W corresponds to its indicator 
vector By abuse of notation, W is used to represent both a 
subset of [m] and a binary vector of length m. There is a 
directed edge from to W2 if and only if |W2\Wi| is odd. 
Let H be an induced subgraph of Dm- We construct the code 
C{H) from the vetices of H by Construction [T] By Lemma |7] 
we know an edge from Wi to W2 in H means /w2(Xw2) H 
/wi(^W2) = ®' so only half information from the column 
corresponding to Wi is needed when we rebuild the column 
corresponding to W2. Then in an s-duplication of the code, 
when we rebuild a column in the z'-th copy, the average ratio 
accessed in the ;-th copy would be 

\V{H)\^2"' 



|E(H)|2 



m— 1 



E(H) 



2 



|y(H)|22™ 2|y(H)|2 

for i 7^ y, which equals the rebuilding ratio of the s-duplication 
code for large s. Namely, 

limR(C')=l-^. (8) 

The following theorem states that the ratio in Corollary [TT] 
of the s-duplication of the code from the standard basis is 



optimal among all codes constructed by binary vectors and 
duplication. 

Theorem 13 For any induced subgraph H of Dm, dn < 
Therefore, for fixed m and large s, the ratio 2 (1 + ^ ) optimal 
among all s-duplication codes constructed by binary vectors of 
length m and Construction^ 

Proof: We say that a binary vector is an even (or odd) 
vector if IIW2II1 is (or 1). For two binary vectors Wi, W2, 
|W2\Wi| being odd is equivalent to 

1 = W2 ■ wr= W2(i+ Wi) = 11W2II1 + W2 ■ Wi 

When W2 is odd, this means W2 Wj =0 or Wi, W2 are orthog- 
onal vectors. When W2 is even, Wi,W2 are not orthogonal. 
Hence, one can check that when and W2 are both odd 
(or even), there is either no edge or 2 edges between them. 
Moreover, when Wj is odd and W2 is even, exactly one of the 
following is true: |W2\Wi| is odd, or |Wi\W2| is odd. Thus 
we have exactly one edge between and W2. 

Assume that there exist subgraphs of Dm with density 
higher than Let H be such a subgraph with maximal 

density. Let S and T be the set of even and odd vectors of H 
respectively. Note that dg 7 = j, because between any even 
and odd vertices there is exactly one directed edge. It is trivial 
to check that if maxjds/Sj} < j then < \ which leads 
to a contradiction. However if max {d^,Sj} > \ = dgj then 
by Lemma [12] we get a contradiction for the assumption that 
H is the subgraph with the maximal density. Thus H contains 
only odd vectors or even vectors. Let V{H) = {vi, ...,Vi^}, 
and assume that the dimension of the subspace spanned by 
these vectors in F™ is / where Vi,V2,-.-Vi are basis for it. 
Define S = {vi, ...Vi},T = u/^}. The following two 

cases show that the density can not be higher than 

H contains only odd vectors: Let u E T. Since u G 
span(S) there is at least one v E S such that u ■ v ^ and 
thus {u,v), {v,u) i E{H), therefore the number of directed 
edges between u and S is at most 2{l — 1) for all u E T, 
which means dg j < '-j^ < -^^^ and we get a contradiction 
by Lemma [T2l 

H contains only even vectors: Since the c,'s are even the 
dimension of span(S) is at most m — 1 (since for example 
(1,0, ...,0) ^ span(S)) thus / < m - 1. Let H* be the induced 
subgraph of D,„+i with vertices V{H*) = {(1, |z7/ E 
V{H))}. It is easy to see that all the vectors of H* are odd, 
{{l,Vi),{l,Vj) E E(H*) if and only if {vi,Vj) E E(H) and 
the dimension of span(y(H*)) is at most Z + 1 < m. By the 
case already proven for odd vectors we conclude that 



dn = dn* 



< 

< 



dim(span(y(H*))) - 

dim(span(y(H*))) 

l + l-l 

l + l 
m — 1 



m 



Therefore, the asymptotic optimal rebuilding ratio for du- 
plication codes generated by binary vectors is 2(1 + -J;-) by 
dill ■ 

V. Finite Field Size of a Code 

We have already shown that the code C in Theorem[8]has the 
best ratio, and if the code is over some big enough finite field 
F, it is MDS. In this section, we will discuss in more detail 
about the field size such that the code C or its s-duplication 
C is MDS. 

Consider the code C constructed by Theorem |8] Let the 
information in row /, column j be E F. Let its row 
coefficient be a,y and zigzag coefficient be [ij j. For a row 
set Ru = {au,o,iiu,l, ■ ■ ■ ,^u,k}^ the row parity is r„ = 
T!j=o'^u,jau,j- For a zigzag set Z„ = {fl;Q,0/«/i,l/ ■ • ■ / fl;^,/c}, 
the zigzag parity is z„ = E}=o ^Ij.j^lj.j- 

The (m + 3,77? + 1) code is MDS if we can recover the 
information when 1 or 2 columns are erased. It is easy to do 
rebuilding when 1 or information column is erased. And 
a/y, /5/y should be non-zero if one information column and 
one parity column are erased. For now, we will assume that 

y = 1 for all 

Construction 2 For the code C in Theorem [8] over F3, define 
Ui = YL\=Q for < i < m. Assign row coefficients as OLj^j = 
1 for all i, j, and zigzag coefficients as 



hi 



2, ifuj-i = l 
1, otherwise. 



where i = (fj, . . . , im) is represented in binary and the opera- 
tions are done over F2. 

Theorem 14 Construction^is an (m + 3, m + 1) MDS code. 

Proof: In an erasure of two systematic columns i,j E 
[0,m],i < j, we access the entire remaining information in 
the array. Set r' = r + e, ■ + Sj, and recall that a^^i e Z; if Z = 
r + e,, thus ay i,ayi j E ^r^e^ and ay^yUyi^^ E Zr+Cj- From the 
two parity columns we need to solve the following equations 
(for some 1/1,1/2/1/3/1/4 £ F3) 



1 


1 














V\ 








1 


1 








yi 










h,j 








ys 


















. y4 



This set of equations is solvable if 

For columns < i < j < m and rows r, r' defined above. 



(9) 



M/ ■ r + Ui ■ r' = Ui ■{r + r') = Y, {^i + ^j) 

1=0 

where the calculations are done modulo 2. And 



1, 



and get a contradiction by Lemma [T2l 



Uj - r + Uj ■ r 



Uj-{r + r') = Y,ei{ei + ei) = e] 
1=0 



Thus fiy j 7^ /5,,/ and j = ^yi j. Note that each of the j6's is 
either 1 or 2, so ^ is satisfied and F3 ensures the code to be 
MDS. ■ 

It is clear that F2 does not suffice for an MDS code, so 
3 is the optimal field size. The coefficients in Figure |3] are 
assigned by Construction |2] 

For s-duplication code C in Theorem [TT] denote the coef- 
ficients for the element in row / and column f by aj^' and 

|S|y^, < f < s — 1. Let be a field of size q, and suppose its 

elements are {Q,a^,a^,. . .,a^^^} for some primitive element 
a. 

Constructions For the s-dupUcation code C in Theorem [TTI 
over Wq, assign = 1 for all i, j, t. For odd q,lets < cj — 1 
and assign for allt^ [0, s — 1] 



a', 



if Uj ■ i 
o.w. 



where Uj = E^^g Poreven q (power of 2), lets < q — 2 and 
assign for all f G [0, s — 1] 

a~^~^, if Uj ■ i = 1 



o.w. 



Notice that the coefficients in each duplication has the same 
pattern as Construction |2] except that values 1 and 2 are 
replaced by a' and a*^^ if q is odd (or and a^*~^ if 
q is even). 

Theorem 15 Construction^ is an (s(m + 1) + 2,s(;7; + 1)) 
MDS code. 

Proof: For the two elements in columns f('i), /(^z) and row 
r, ti 7^ t2, we can see that they are in the same row set and 
the same zigzag set. The corresponding two equations from the 
two parity elements are linearly independent if f>^^f* 7^ f^'^rf ^ 
which is satisfied by the construction. 

For the four elements in columns f('i),y('2) and rows r, r' = 
r + ei + ej, < ti,t2 ^ s — 1, < i < j < m, the code is 
MDS if 



By proof of Theorem [TS] we know that jSj^^?' 7^ /^|.'V' ^^'^ 
f^^rf ~ ^r' ~ some X. When q is odd, 

for any x and fi. When q is even. 



xih) 



for any ti and 1 < x < q — 2 (mod q — 1). And by construc- 
tion, X = t2 + 1 01 X = —t2 — 1 for < t2 < s — 1 < ij' — 3, 
sol<x<ij' — 2 (mod q — 1). Hence, the construction is 
MDS. ■ 

Theorem 16 For an MDS s-duplication code, we need a finite 
field Wq of size q > s + 1. Therefore, Theorem [TSl is optimal 
for odd q. 



Proof: Consider the two information elements in row i 
and columns which are in the same row and zigzag 

sets, for 7^ f2 e [0,s - 1]. The code is MDS only if 

has full rank. All the coefficients are nonzero (consider 
erasing a parity column and a systematic column). Thus, 
{^\f)-^^\f + {4fr^^^!-\ and (jx^h^^^] are distinct 
nonzero elements in Fg, for z G [0, s — IJ. So > s + 1. ■ 

The coefficients in Figure |4] are assigned as Construction |3] 
and F3 is used. One can check that any two column erases 
can be rebuilt in this code. 

Consider for example of an s-duplication code with m = 10 
of the code in Theorem[8] the array is of size 1024 x (lis + 2). 
For s = 2 and s = 6, the ratio is 0.522 and 0.537 by Corollary 
nn the code length is 24 and 68, and the field size needed can 
be 4 and 8 by Theorem [15] respectively. Both of these two 
codes are suitable for practical applications. 

As noted before the optimal construction yields to a ratio 
ofl/2 + l/ mby using duplication of the code. However the 
field size is a linear function of the number of duplication of 
the code. Is it possible to extend the number of columns in 
the code while using a constant field size? We know how to 
have 0{nr') columns by using O(m^) duplications, however, 
the field size is 0{np-). 

The following code construction is a modification of Exam- 
ple [T] and uses only the field of size 9. The rebuilding ratio 
is asymptotically optimal, namely, l/2 + 0(l/m). Let the 
elements of F9 be {Q,a^ , . . . ,a^} where a is a primitive 
element. 



Construction 4 Let ?>\m, and consider the following set of 
vectors S C F™; for each vector v = {vi, . . . ,v„i) E S, 
\\v\\i = 3 and Vi,Vj,vi = 1 for some i e [l,;7;/3],; E 
[m/3 + 1,2111/3], I E [2m/3 + l,m]. For simplicity, we write 
V = Construct the {k + 2,k) code as in Construction 

[T] using the set of vectors S, hence k = \S\ = (y)'^ = 
^. For i E [l,m/3], define Mi = Et=l ^t- Similarly, for 
i E [m/3 + l,2m/3], define Mj = Y^\^^^^^^et and for 



I E [lm/3 + l,m], let Mi 
3 X m matrix 



■^t=2m/3+ 



^ gf. Then define a 



Mv 



Mj 



Mj 



Mj 



forv = {i,], /}, and MJ is the transpose of Mi. Assign the row 
coefficients as 1 and the zigzag coefficient for row r column 
c{v) as a* , where t = rMi, (in its binary expansion) and a is the 
primitive element. 

For example, let m = 6, and v = {1,4,6} = 
(1,0,0,1,0,1) E S. The corresponding matrix is 

T 



For row r = 26 = (0, 1, 1, 0, 1, 0), we have 
t = rMv = (0,1,1) = 3, 
and the zigzag coefficient is a^. 

Theorem 17 Construction^ is a {k + 2,k) MDS code with 
array size 2"' x {k + 2) and k = m'^/27. Moreover, the 
rebuilding ratio is j + for large m. 

Proof: For each vector v E S, there are 3{m/3 — 1)'^ 
vectors m G S such that they have one 1 in the same location 
as V, i.e. |By\B„| = 2. Hence by Theorem |6] and Lemma |7] 
for large m the ratio is 

1 ^ 3((f)-l)^ ^l ^ 9 

2 2(-^ + l) ~2 2m 

Now we show that the code is indeed MDS. Con- 
sider columns c{u),c{v) for some u = {h,ji,h} 7^ 
V = {h,i2'h} and /i,Z2 e [l,m/3],;i,;2 e [m/3 + 
l,2m/3],Zi,Z2 G [2m/3 + l,m]. And consider rows r and 
r' = r + u + V. The same as proof of Theorem [14] we know 
that the 4 elements in these rows and columns are in two row 
sets and two zigzag sets. Moreover, we know from that proof 
that if li 7^ I2, say li < I2, then 

rMl 7^ r'Ml (10) 

and 

rMl = r'Ml. (11) 

If h = then 

rMl = '"'■^^ = ''^'^ = ^'^h 

And similar results hold for ii,i2 and /i,72- Next we check if 
(|9]l is satisfied, which is equivalent to check 

rMl + r'Ml ^ rMj, + r'Mj mod 8 (13) 

by Construction |4] Here we view each vector of length 3 as 
an integer in [0,7] and the addition is ordinary addition of 
integers instead of bitwise XOR. 

We do the addition in ( fT3] l from the Isb (least significant 
bit) to the msb (most significant bit). If the sum in a bit is 
more than 1, the carry is 1 in the next bit. Notice that mod 8 
only changes the fourth bit and will not affect the first 3 bits. 

If li 7^ I2, the Isb in the sum in ( fT3] l are different on the two 
sides by (fTOl i ( fTTT i. and we are done. Assume li = I2, by (fT2] i 
the Isb in the sum in ( fT3] l are on both sides, and the carry 
in the second bit are equal. In this case, if ji 7^ j2- since the 
carry are the same, by similar reason (ITji holds. If /i = i2, 
again the second bit are equal and the carry in the third bit 
are equal. In the latter case z'l 7^ Z2 since u 7^ v, and we have 
the third bit are different. Therefore, (ITJi is satisfied for any 
two vectors in T, and the code is MDS. ■ 

Notice that if we do mod 15 in ( fT3] l instead of mod 8, 
the proof still follows because 15 is greater than the largest 
possible sum in the equation. Therefore, a field of size 16 is 



also sufficient to construct an MDS code, and it is easier to 
implement in a storage system. 

Suppose IV < m and w\m. Construction|4]can be easily gen- 
eralized to the set of vectors T, such that any c G T satisfies 
= zf, Vi. = 1, for some ij E [jm/iv + 1, [j + l)m/w], 
j E [0,...,w -1]. The {k + 2, k) MDS code has array size 
2"' X (fc + 2) with 

ni 

k = {-r, 

w 

and uses a finite field of size 

q = 2"' + l 

if 2™ + 1 is a power of a prime. In all cases, q = 2™+^ suffices 
to make the code MDS. Moreover, when zv is odd and small 
compared to m, the ratio is 

1 

2^ 2m 

for large m. 

VI. Decoding of the Codes 

In this section, we will discuss decoding algorithms of the 
constructed codes in case of column erasures as well as a 
column error The algorithms work for both Construction [l] 
and its duplication code. 

Let C be a (fc + 2, k) MDS array code defined by Con- 
struction [T] (and possibly duplication). The code has array size 
2"' X {k + 2). Let the zigzag permutations be fj, j E [0, fc — 1], 
which are not necessarily distinct. Let the information ele- 
ments be Uj j, and the row and zigzag parity elements be r, 
and z,, respectively, for i E [0,2"' - 1],] E [0,k-l]. W.l.o.g. 
assume the row coefficients are a, j = 1 for all And let 
the zigzag coefficients be /3, y in some finite field F. 

The following is a summary of the erasure decoding algo- 
rithms mentioned in the previous sections. 

Algorithm 1 (Erasure Decoding) 
One erasure. 

1 ) One parity node is erased. Rebuild the row parity by 

k-l 

j=0 

and the zigzag parity by 

k~l 

^'=Lf^fr^i.)ffrHi),r (15) 
;=0 ' ' 

2) One information node j is erased. Rebuild the elements in 
rows Xj (see Construction\Vi) by rows, and those in rows Xj by 
zigzags. 

Two erasures. 

1 ) Two parity nodes are erased. Rebuild by (fT4l i and ( fTSl l. 

2) One parity node and one information node is erased. If the 
row parity node is erased, rebuild by zigzags; otherwise rebuild 
by rows. 



3) Two information node ji and ;2 are erased. 
- Iffj^ = /.^, for any i E [0,2"' - 1], compute 



E 



Solve UiL, Uji^ from the equations 



" 1 1 ■ 














. Hj2 . 







- Else, for any i e [0,2™ - 1], set i' = i + fj^{Q) + 
fj^ (0), and compute X;, Xii,yi, y,/ according to ( fT6l l. Then solve 
ai j^,ai j^, aji j^,ajr j^ from equations 
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. Vi' . 



In case of a column error, we first compute the syndrome, 
then locate the error position, and at last correct the error. 
Let Xq,Xi, . . . ,Xp^i G F. Denote /^^ (xg, Xj, = 
(xyr-i(Qj,Xyr-i(2j, . . .,Xyr-i^p_2^) for a permutation / on 
[0, p — 1]. The detailed algorithm is as follows. 

Algorithm 2 (Error Decoding) 
Compute for all i e [0, 2™ - 1] ; 

Si,o = E "hi ~ 

M 

% = L/fr^(i)ffr\i),j-^i 
Let the syndrome be Sq = (So,0/ Si,0/ • ■ • / S2n'_i,o) and Si = 

(So,l/Sl,l, . . .,S2m-l,l). 

- if So =0 and Si = 0, there is no error 

- Else if one of Sq, Si is 0, there is an error in the parity. Correct 
it by (Ell or 

- Else, find the error location. For j = tok — 1: 

Compute for all i e [0,2™ - 1], x,- = |6gS,;o. 
Let X = (xo, . . . , X2.«_i) and Y = /r^ (X). 
Jf y = Si, subtract Sq from column j. Stop. 
If no such j is found, there are more than one error 

If there is only one error, the above algorithm is guaranteed 
to find the error location and correct it, since the code is MDS. 
If the computations are done in parallel for all i G [0,2'" — 1], 
then this algorithm can be done in time 0{k). Moreover, 
since the permutations /,'s only change one bit of a number 
in [0,2"' — 1] in Theorem [8] the algorithm can be easily 
implemented. 



VII. Generalization of the code construction 

In this section we generalize Construction [T] into arbitrary 
number of parity nodes. Let n — k = r he the number of 
parity nodes, we will construct an {n,k) MDS array code, 
i.e., it can recover up to r nodes erasures for arbitrary integers 
n, k. We assume that each systematic node stores ^ of the 
information and is stored in columns [0,k — 1]. The f-th parity 
node is stored in column k + i, where < f < r — 1 and is 
comprised of zigzag sets {Zj}. 

Construction 5 Let T be a subset of vector of Zj," for some 
integer m, where for each 

V = {vi,...,Vm) e r,gcd(z;i,. ..,!;«,/-) = 1. (16) 

We will construct for each integer f, < i < r — 1 a set of 
permutations Bj = {fl : v E T} where each permutation 
acts on the set [0, r'" — 1]. By abuse of notations we use x 
both to represent the integer and its r-ary representation and 
all the calculations are done over Z^. Define /^(x) = x + iv, 
for example ifr = 3,x = 5,i = 2,v= (0, 1) 

/(\,) (5) =5 + 2(0,1) = (1,1) + (0,2) = (1,0) = 3. 

We define the zigzag in parity node s,0 < s < r — 1 and 
m G [0, r'" — 1], as the linear combination of the elements aj^j 
such that their coordinates satisfy 



m. 



Note that these parity nodes described now, form an (n, k) MDS 
array code under appropriate selection of the coefficients in 
the linear combinations of the zigzag. The proof follows the 
exact same lines as in Theorem [3] and therefore is omitted. 
In a rebuilding of systematic node i the elements in rows 
X- = {i? G [0, r'" — 1] : V ■ Vi = r — s} are rebuilt by parity 
node s where < s < r — 1. From ( fTSI ) we get that for any i 
ands, |X^| = /-'"-l. 

Let the information array be A = (fly) with size r'" x |T|. 
Denote by c(m) the column index corresponding to vector u G 
T. Define f^'{x) = x — iu, which is the inverse permutation 
of , < f < r — 1. For any x G [0, r'" — 1], the x-th zigzag 
set in the z'-th parity node is = ^ " G T}. In 

a rebuilding of column c{v), by Construction |5] we need to 
access the elements in the zigzag sets Z'^ for any x G fvi^'v) 
and i G [0,/" — 1]. Hence, in column c{u), u v, we access 
elements in rows 

<r — l r—i ri i 



(17) 



In order to get a low rebuilding ratio, we want that the union 
of ([TtI i over all the columns c{u),u v to be as small as 
possible. 

In particular, for a given u if /,7'/i>(XJ,) are identical for any 
< / < r — 1, we need only to access \fu'fv{Xl,)\ = r'"^^ 
elements from column c{u). Notice that in this case, for all 
< z < /- - 1, 



(18) 



simply because is the identity permutation for any vector 
V. We say the permutation set {//,}o</<r-l is orthogonal to 
the permutation set {/^}o<;<r-l if CSl holds for all i. 

In general, we want to know the relation of fu'fl{X[,), 
< i < r — 1. And the following lemma shows they are 
either identical or disjoint. 



Lemma 18 For any u, v, 

\LT'n{K)nfu'M)\ 



\x% {i-i)c,,u 
0, o. w. 







Where Ci,,„ = v ■ {v — u) — 1. In particular for j = we get 

/■0\ jrifviw _ J ifiCv,ii = 



\fl{x^)nfUK] 



0, 



o.w. 



Proof: Consider the group (Z"', +). Let e XJ, and 
fl{, e X{,. Note that X° is a subgroup of Z™ and XJ, = X° + 
is its coset. Hence the cosets fu'fvi^v) = + + '('^ ~ ^0 
and /i, 'fl{Xl) = + al + i{v — u) are either identical or 
disjoint. Moreover they are identical if and only if 



I.e. 



(i-i) 



-al + {i-j){v-u) eX°, 
and the result follows. 



Theorem 19 The permutations set {fj} and sets X|, for < 
I < m,0 < i < r — 1 constructed by the vectors {e(}"lg and 
Construction^where Xq is modified for any i tobeX'^ = {x E 
ZJ." : X ■ (1, 1, 1) = /}, is a set of orthogonal permutations. 
Moreover the (m + l + r, m + 1) MDS array code of array size 
r™ X {m + 1 + r) defined by these permutations has optimal 
ratio of J. 

Proof: For 1 < l,n < m, C]^„ = e; ■ (e; — en) — 1 = 0, so 
by Theorem [18] {/^}o<!<i--l is orthogonal to {//}o<Kr-l- 
For 1 < n < m, and all < / < r — 1, 



/^(X0)=x0 = {x:x„=0} 



and 



fni^'n) = f'niU ' = -?}) = U + i^n : X,, = -l} 
= {y ■■ Vn = 0} 

Therefore, {/Q}o<Kr-i is orthogonal to {fn}o<i<r-i- Simi- 
larly, 

fLiA) =mx:x-{l,...,l)=0}) = {x:x-{l,...,l) = 
and 

/^(Xj,)=X^ = {x:x.(l,...,l)=f} 

for all i. Thus, {/,^}o</<r-l is orthogonal to {/o}o<Kr-i- 
Therefore, in order to rebuild one systematic node, we access 
only r™"^ elements in each surviving node. Hence the ratio 
is 1/r. ■ 
The following theorem gives the ratio for any code of 
Construction |5] 



Theorem 20 The ratio for the code constructed by Construc- 
tion^and set of vectors T is 

1 , ITI 



;cd(r,Ciy,) 



|T|(|r| -1 + r) 
where gcd is the greatest common divisor 

Proof: By previous discussions and noticing that we 



access r'" ^ elements in each parity node, the ratio is 

\T\{\T\ - l + r)r'" 

From Lemma [18] and noticing that \{i : icy^u 
mod r}| = gcd(r, Cy^,,), we get 

-'"-1 xr/gcd(r,c„,„). 



(19) 
= 



\^\ZUu'mv)\-r" 

Thus, the theorem follows. ■ 
When Cy^u = for all v,ii e T, the ratio is 1/r and is 
optimal according to ([T]i. The next theorem states that the 
construction using standard basis has the most number of 
columns possible for an optimal ratio code. 

Theorem 21 For an orthogonal set of permutations 

{//}o<;<s-l,0</<r-l over the integers [Q,r^ - 1], 

and the corresponding sets {Xy\, i.e., for all 
< m,l < s - 1,0 < i < r - 1, 



fi,,{x^)=fi{xi) 



we have s < k + 1. 



Proof: We prove it by induction on k. When k = 0, 
s < 1. Notice that for permutations g,hi, < i < r — 1, 
the set of permutations {hiflg} are still orthogonal under 
the sets {^"^(XJ)}. This is because hif'^g{g-^{X^)) = 
hifUX^) = h.fiiXj) = h,flg{g-\X\)). Therefore, we 

• " 1], and 



can assume w.l.o.g. Xq = [fr ^,(/ + l)r' 
is identity, for < z < r — 1. 

Since //(Xg) = /,j(Xj,) = Xj, for all / 7^ 0, we have 
/4(X0n Xg),//(X j nXg) C X^. similarly, for all / ^ 0, 

/Kxg) = /Kxg) = and /;(xo n xg),//(x; n Xg) C 

Xj,. But /4(X0) = //(XJ), which implies 

/;(xOnxg)=//(xj-nxg). 

Thus the set of permutations {//}i</<s_i o<!<r-i over 
[Cr'^-l - 1], where //(x) = x- fr'^'-^'and sets' {Xj' n Xg} 
Oare orthogonal. And by induction, s — 1 < fc. ■ 
The next theorem gives the finite field size of a code with 
3 parities. 

Theorem 22 For r = 3 parity nodes the code constructed by 
the standard basis and the zero vector{e,}"^Q G together 
with Theorem \l9[ a Geld of size at most 2{m + 1) suffices to 
make it an (m + 4, m + 1) MDS code. 

Proof: Let be a field of size q > 2{m + l), and q is a 
prime. For any I E [0,m] let A; = (fl/j), be the permutation 
matrix defined by the permutation f}^ i.e. a,- ^ = 1 iff f}^ (j) = i. 



else fl/y = 0. Let a be a primitive element of the field F^, and Note that = {AjAj = a^^' ^'Z 7^ / since i — j < m < 
modify the nonzero entries of A; as follows, if a, y = 1 and ?-l j|gjj(,g 



j ■ ei = 0, modify it to y = a' 



We will show that under this assignment of coefficients the 
matrices A; commute, i.e. for any Zi 7^ Z2 G [^r^t], Ai^Ai^ = 
Ai^Ai^. For simplicity, write fe^^ = /i//e,, = fi ™d p = 



[a 



),Ai^ = (bj^j). For a vector x 



3'". And write A/j 

{xq, . . . , Xp^l) and y = xAi^, its ;-th entry satisfies 
fly^(-yj j^f-i{j) for all G [0, p — 1]. And by similar calculation, 
z = xA/jA;^ = i/A;^ will satisfy 



"f2{i),iyf2{i) - ^/2(i)//l(/2(/))-/2(/)^/l(/2(/))- 



Similarly, if zu = xAi^Ai^, then 



/ia)j^/2(/i(/)),/ia)^/2(/i(/))- 



Notice that 



Moreover, 



^fiifmUi)- Similarly, a^^yj^y = a^^^f^^j^^j^^jy 



/i(/2(/))=/2(/i(;)) 

Hence, Zy = for all / and 

xAi^Ai^ = 



IV 



xAi^Ai^ 



for all X eWf. 



Thus A; A; 



A;, A;, 



Note that f^^ = (/e, hence define the set of matrices Aj to 
be A^ = (A;)'^. The code is MDS if it can recover from loss 
of any 3 nodes. Hence With this assignment of coefficients 
the code is MDS iff any block submatrices of sizes 1 x 1, 2 x 
2, 3 X 3 of the matrix 



/ I 
Ao Ai 
A§ Al 



I 

Am 



are invertible. Let < f < 7 < fc < m we will see that the 
matrix 

/ I I 

A, Aj Ak 



Ai 



4 



is invertible. By Theorem 1 in ll20l and the fact that all the 
blocks in the matrix commute we get that the determinant 
equals to det(A)t - Ay) ■ det(Afc - A,) ■ det(Ay - A,). Hence 
we need to show that for any i > j that det(A; — Ay) 7^ 
which is equivalent to det{AiAJ^ — I) 7^ 0. Note that 
for any i, A^ = a'l. Denote by A = AjAj^, hence 
A3 = (A;A-l)3 = A^A-^ = a'-il 7^ 1. Therefore 

7^ det(A3 -I)= det(A - /) det(A2 + A + I). 

Therefore det(A - /) = det(AiArl - /) ^ 0. For the case 
when one of the three erased columns is a parity column it 
sufficient to check that for any / > / that 



det( 



I 

A? 



det(A-^) det(Af Ar2 - I) 7^ 0. 



7^ det(A^ -I)= det(A2 - I){A'^ + A^ + I), 

and det(A2 - ^) = det(A^Ar2 _ ^ q which concludes 
the proof. ■ 

VIII. Concluding Remarks 

In this paper, we described explicit constructions of the 
first known systematic {n,k) MDS array codes with n — k 
a constant (independent of «) and with an optimal rebuilding 
ratio. Specifically, the amount of information needed to rebuild 
an erased column equals to l/{n — k), which matches the 
information-theoretic lower bound. 

Here we provide additional observations and indicate a 
couple of open research problems. 

The write complexity: Consider the number of read accesses 
in the case of a write (update) operation. For example, in 
an array code with two redundancies, in order to update a 
single information element, one needs to read at least three 
times and write three times. The reason is that in order to 
compute the new parity element, we need to know the values 
of the current information element and the two current parity 
elements. However, in our optimal code construction with two 
redundancies, if we update all the information in column 1 and 
the rows in the first half of the array (see Figure |3), it is not 
necessary to read any information or parity elements, since 
we know the values of all the information elements needed 
for computing the parity. These information elements take 
about half the size of the entire array - the idea is to cache 
the information to be written until most of the corresponding 
elements need update (we could arrange the information in 
a way that these elements are often updated at the same 
time), and the number of reading operations compared to the 
information size is very small. Similarly we can use the same 
approach for any other systematic column. In general, given 
r redundancies, we could avoid read operations (in support of 
a write operation) if we make sure that we update about 1/r 
of the array at the same time. 

Lowering the rebuilding ratio: We note that one can add 
redundancy columns for the sake of lowering the rebuilding 
ratio. For instance, one can use three redundancy columns 
where the third column is not used for erasure correction - 
with three redundancy columns we need to access only 1/3 
of data instead of 1/2 in the rebuilding of a single failed 
column. 

Open problem: For a given array size, what is the rebuilding 
ratio for a code defined by arbitrary permutations? In Theo- 
rem [13] we showed that 1/2 + 1/A: is optimal for a code 
constructed by binary vectors and duplication. However, this 
ratio is not known for arbitrary permutations. 
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